NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

RayZer: A Self-supervised Large View Synthesis Model

Jiang, Hanwen; Tan, Hao; Wang, Peng; Jin, Haian; Zhao, Yue; Bi, Sai; Zhang, Kai; Luan, Fujun; Sunkavalli, Kalyan; Huang, Qixing; et al (October 2025, IEEE/CVF, International Conference on Computer Vision)

Free, publicly-accessible full text available October 15, 2026
RandAR: Decoder-only Autoregressive Visual Generation in Random Orders

Pang, Ziqi; Zhang, Tianyuan; Luan, Fujun; Man, Yunze; Tan, Hao; Zhang, Kai; Freeman, William T; Wang, Yu-Xiong (June 2025, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR))

Free, publicly-accessible full text available June 11, 2026
LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias

Jin, Haian; Jiang, Hanwen; Tan, Hao; Zhang, Kai; Bi, Sai; Zhang, Tianyuan; Luan, Fujun; Snavely, Noah; Xu, Zexiang (April 2025, International Conference on Learning Representations (ICLR))

We propose the Large View Synthesis Model (LVSM), a novel transformer-based approach for scalable and generalizable novel view synthesis from sparse-view inputs. We introduce two architectures: (1) an encoder-decoder LVSM, which encodes input image tokens into a fixed number of 1D latent tokens, functioning as a fully learned scene representation, and decodes novel-view images from them; and (2) a decoder-only LVSM, which directly maps input images to novel-view outputs, completely eliminating intermediate scene representations. Both models bypass the 3D inductive biases used in previous methods—from 3D representations (e.g., NeRF, 3DGS) to network designs (e.g., epipolar projections, plane sweeps)—addressing novel view synthesis with a fully data-driven approach. While the encoder-decoder model offers faster inference due to its independent latent representation, the decoder-only LVSM achieves superior quality, scalability, and zero-shot generalization, outperforming previous state-of-the-art methods by 1.5 to 3.5 dB PSNR. Comprehensive evaluations across multiple datasets demonstrate that both LVSM variants achieve state-of-the-art novel view synthesis quality. Notably, our models surpass all previous methods even with reduced computational resources (1-2 GPUs).
more » « less
Free, publicly-accessible full text available April 24, 2026
MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data

Jiang, Hanwen; Xu, Zexiang; Xie, Desai; Chen, Ziwen; Jin, Haian; Luan, Fujun; Shu, Zhixin; Zhang, Kai; Bi, Sai; Sun, Xin; et al (June 2025, IEEE/CVF International Conference on Computer Vision)

Free, publicly-accessible full text available June 1, 2026
Neural Gaffer: Relighting Any Object via Diffusion

Jin, Haian; Li, Yuan; Luan, Fujun; Xiangli, Yuanbo; Bi, Sai; Zhang, Kai; Xu, Zexiang; Sun, Jin; Snavely, Noah (December 2024, Conference on Neural Information Processing Systems (NeurIPS))

Full Text Available
Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling

Wu, Liwen; Bi, Sai; Xu, Zexiang; Luan, Fujun; Zhang, Kai; Georgiev, Ilyan; Sunkavalli, Kalyan; Ramamoorthi, Ravi (June 2024, CVPR 24)

Full Text Available
PSDR-Room: Single Photo to Scene using Differentiable Rendering

https://doi.org/10.1145/3610548.3618165

Yan, Kai; Luan, Fujun; Hašan, Miloš; Groueix, Thibault; Deschaintre, Valentin; Zhao, Shuang (December 2023, ACM)

Full Text Available
NeuSample: Importance Sampling for Neural Materials

https://doi.org/10.1145/3588432.3591524

Xu, Bing; Wu, Liwen; Hasan, Milos; Luan, Fujun; Georgiev, Iliyan; Xu, Zexiang; Ramamoorthi, Ravi (July 2023, ACM)

Full Text Available
Reconstructing Translucent Objects using Differentiable Rendering

https://doi.org/10.1145/3528233.3530714

Deng, Xi; Luan, Fujun; Walter, Bruce; Bala, Kavita; Marschner, Steve (July 2022, SIGGRAPH '22: ACM SIGGRAPH 2022 Conference Proceedings)

Inverse rendering is a powerful approach to modeling objects from photographs, and we extend previous techniques to handle translucent materials that exhibit subsurface scattering. Representing translucency using a heterogeneous bidirectional scattering-surface reflectance distribution function (BSSRDF), we extend the framework of path-space differentiable rendering to accommodate both surface and subsurface reflection. This introduces new types of paths requiring new methods for sampling moving discontinuities in material space that arise from visibility and moving geometry. We use this differentiable rendering method in an end-to-end approach that jointly recovers heterogeneous translucent materials (represented by a BSSRDF) and detailed geometry of an object (represented by a mesh) from a sparse set of measured 2D images in a coarse-to-fine framework incorporating Laplacian preconditioning for the geometry. To efficiently optimize our models in the presence of the Monte Carlo noise introduced by the BSSRDF integral, we introduce a dual-buffer method for evaluating the L2 image loss. This efficiently avoids potential bias in gradient estimation due to the correlation of estimates for image pixels and their derivatives and enables correct convergence of the optimizer even when using low sample counts in the renderer. We validate our derivatives by comparing against finite differences and demonstrate the effectiveness of our technique by comparing inverse-rendering performance with previous methods. We show superior reconstruction quality on a set of synthetic and real-world translucent objects as compared to previous methods that model only surface reflection.
more » « less
Full Text Available
IRON: Inverse Rendering by Optimizing Neural SDFs and Materials from Photometric Images

Zhang, Kai; Luan, Fujun; Li, Zhengqi; Snavely, Noah (June 2022, IEEE Conference on Computer Vision and Pattern Recognition)

We propose a neural inverse rendering pipeline called IRON that operates on photometric images and outputs high-quality 3D content in the format of triangle meshes and material textures readily deployable in existing graphics pipelines. Our method adopts neural representations for geometry as signed distance fields (SDFs) and materials during optimization to enjoy their flexibility and compactness, and features a hybrid optimization scheme for neural SDFs: first, optimize using a volumetric radiance field approach to recover correct topology, then optimize further using edgeaware physics-based surface rendering for geometry refinement and disentanglement of materials and lighting. In the second stage, we also draw inspiration from mesh-based differentiable rendering, and design a novel edge sampling algorithm for neural SDFs to further improve performance. We show that our IRON achieves significantly better inverse rendering quality compared to prior works.
more » « less
Full Text Available

« Prev Next »

Search for: All records